Intelligent Rollups in Multidimensional OLAP Data

نویسندگان

  • Gayatri Sathe
  • Sunita Sarawagi
چکیده

In this paper we propose a new operator for advanced exploration of large multidimensional databases. The proposed operator can automatically generalize from a specific problem case in detailed data and return the broadest context in which the problem occurs. Such a functionality would be useful to an analyst who after observing a problem case, say a drop in sales for a product in a store, would like to find the exact scope of the problem. With existing tools he would have to manually search around the problem tuple trying to draw a pattern. This process is both tedious and imprecise. Our proposed operator can automate these manual steps and return in a single step a compact and easy-to-interpret summary of all possible maximal generalizations along various roll-up paths around the case. We present a flexible cost-based framework that can generalize various kinds of behaviour (not simply drops) while requiring little additional customization from the user. We design an algorithm that can work efficiently on large multidimensional hierarchical data cubes so as to be usable in an interactive setting.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Integrating Semantics within Compressed OLAP Views in the Hand-OLAP System (Extended Abstract)

In this paper, we provide further extensions of Hand-OLAP, a Java-based distributed system for enabling OLAP in mobile environments via intelligent data cube compression approaches. These extensions aim at integrating innovative semantics representation and management models within compressed OLAP views, in order to improve the data cube compression process itself, and to support an improved su...

متن کامل

Using OLAP and Data Mining for Content Planning in Natural Language Generation

We present a new approach to content determination and content organization in the context of natural language generation for quantitative database summaries. Three key properties make our work innovative and interesting: (1) we developed a new text planning approach to deals with the content organization of a data set into a summary report, for example a Data Mining discovery; (2) the approach...

متن کامل

Content aggregation in natural language hypertext summarization of OLAP and Data Mining Discoveries

We present a new approach to paratactic content aggregation in the context of generating hypertext summaries of OLAP and data mining discoveries. Two key properties make this approach innovative and interesting: (1) it encapsulates aggregation inside the sentence planning component, and (2) it relies on a domain independent algorithm working on a data structure that abstracts from lexical and s...

متن کامل

QB4OLAP: A New Vocabulary for OLAP Cubes on the Semantic Web

On-Line Analytical Processing (OLAP) tools allow querying large multidimensional databases called data warehouses (DW). OLAP-style data analysis over the semantic web (SW) is gaining momentum, and thus SW technologies will be needed to model, manipulate, and share multidimensional data. To achieve this, the definition of a precise vocabulary that adequately represents OLAP data on the SW is req...

متن کامل

Representing Temporal Data in Non-Temporal OLAP Systems

Multidimensional data warehouses and OLAP systems do not provide adequate means for dealing with changes in dimension data, changes appearing frequently in dynamic application areas as current business systems. As data warehouses and OLAP tools serve as decision support systems they have to reflect such changes. Temporal data warehouses propose sophisticated modelling tools for covering any cha...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2001